Detecting AI-Generated Text
Learn how AI-generated text is detected and how to avoid this detection.
We'll cover the following
How is AI-generated text detected?#
Various experimental tools have been developed to detect AI-generated text and assess its quality. These tools can be useful in identifying patterns that mimic natural writing. One such tool, Giant Language Model Test Room (GLTR), released by Harvard and the MIT-IBM Watson AI Lab, scans text and highlights words based on their level of randomness.
Another tool, called GPTZero, developed by Edward Tian at Princeton, measures the likelihood that ChatGPT generated a piece of content by analyzing its perplexity (randomness) and burstiness (variance). OpenAI has also introduced a tool called OpenAI AI Text Classifier that can judge the nature of text longer than 1,000 characters.
However, these tools have limitations, including false positives and effectiveness primarily in English. It's important to note that AI text generators like ChatGPT may lack certain qualities and occasionally produce unreliable or hallucinatory content.
Testing GPTZero#
Let's test GPTZero on some content we generate using ChatGPT.
Prompt: Please write a paragraph on bioinformatics. |
Bioinformatics is a multidisciplinary field that combines biology, computer science, and statistics to analyze and interpret biological data. It involves the development and application of computational methods, algorithms, and tools to understand complex biological systems at the molecular level. In bioinformatics, large volumes of biological data, such as DNA sequences, protein structures, and gene expression profiles, are processed, analyzed, and modeled to extract meaningful insights. These insights help in understanding biological processes, identifying genetic variations, predicting protein structures and functions, and investigating disease mechanisms. Bioinformatics plays a crucial role in genomics, proteomics, drug discovery, personalized medicine, and other areas of biological research. By harnessing the power of computational tools and techniques, bioinformatics enables researchers to unravel the mysteries of life and make significant advancements in various fields related to health, agriculture, and environmental sciences.
Here are the results when we run this content through GPTZero:
Here are the stats in more detail:
We can see that GPTZero correctly identifies the AI-generated content and gives us scores for the parameters (burstiness and perplexity) it uses to evaluate.
ChatGPT and Basic Prompting
The Basics of Cover Letters